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We claim: 

1 . A method of determining the nucleotide seqy&ice at an end of a polynucleotide, the 
hod comprising the steps of: 

ligating one or more encoded adaptors/ft) an end of the polynucleotide, each encoded 
aptor having an oligonucleotide tag selected from a minimally cross-hybridizing set of 
igonucleotides and a protruding stranjKcompIementary to a portion of a strand of the 
polynucleotide; and 

identifying one or more mlcleotides in each of said portions of the strand of the 
polynucleotide by specifically/hybridizing a ta 8 complement to each oligonucleotide tag of the 
one or more encoded adapters ligated thereto. 



2. The method of claim 1 wherein said step of ligating includes ligating a plurality of 
1 5 different encoded adaptors to said end of said polynucleotide such that said protruding strands 
of the plurality of different encoded adapators are complementary to a plurality of different 
portions of said strand of said polynucleotide such that there is a one-to-one correspondence 
between said different encoded adaptors and the different portions of said strand. 

20 3. The method of claim 2 wherein said different portions of said strand of said 
polynucleotide are contiguous. 
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4. The method of claim 3 wherein said protruding strand of said encoded adaptors 
contains from 2 to 6 nucleotides and wherein said step of identifying includes specifically 
hybridizing said tag complements to said oligonucleotide tags such that the identity of each 
nucleotide in said portions of said polynucleotide is determined successively. 
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5. The method of claim 4 wherein said step of identifying further includes providing a 
number of sets of tag complements equivalent to the number of nucleotides to be identified in 
said portions of said polynucleotide. 



35 



6. The method of claim 5 wherein said step of identifying further includes providing said 

tag complements in each of said sets that are capable of indicating the presence of a 
predetermined nucleotide by a signal generated by a fluorescent signal generating moiety, 
there being a different fluorescent signal generating moiety for each kind of nucleotide. 



7. The method of claim 5 wherein said oligonucleotide tags of said encoded adaptors are 

single stranded and said tag complements to said oligonucleotide tags are single stranded such 
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that specific hybridization between an oligonucleotide tag and its respective tag complement 
occurs through Watson-Crick base pairing. 

8. The method of claim 7 wherein said encoded adaptors have the form: 



5'-p(N) n (N ) r (N ) S (N ) q (N ) t -3' 
z(N») r (N') s (N') q -5' 



or 



. Stp P(N ) r (N ) S (N ) q (N ) t -3' 

0 Y 3 > -z(N) n (H') r W) a iti t ) q -S' 
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where N is a nucleotide and N' is its complement, p is a phosphate group, z is a 3' hydroxy I or 
a 3' blocking group, n is an integer between 2 and 6, inclusive, r is an integer between 0 and 
15 18, inclusive, s is an integer which is either between four and six, inclusive, whenever the 
encoded adaptor has a nuclease recognition site or is 0 whenever there is no nuclease 
recognition site, q is an integer greater than or equal to 0, and t is an integer greater than or 
*P equal to 8. 

Uj 20 9. The method of claim 8 wherein r is between 0 and 12, inclusive, t is an interger 

between 8 and 20, inclusive, and z is a phosphate group. 
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10. The method of claim 9 wherein members of said minimally cross-hybridizing set differ 
from every other member by at least six nucleotides. 

1 1 . The method of claim 5 wherein said oligonucleotide tags of said encoded adaptors are 
double stranded and said tag complements to said oligonucleotide tags are single stranded such 
that specific hybridization between an oligonucleotide tag and its respective tag complement 
occurs through the format ion of a Hoogsteen or reverse Hoogsteen triplex. 

12. The method of claim 1 1 wherein said encoded adaptors have the form: 



5'-p(N) n (N ) r (N ) S (N ) q (N ) t -3* 
z (N 1 ) r (N 1 ) s (N 1 ) q (N ) t -5' 

35 or 

_ / C\ p(N ) r (N ) S (N ) q (N ) l -3' 

^\)\9S 3 , -z(N) n (N , ) r (N , ) s (N , ) q (N ) t -5' 

40 where N is a nucleotide and N' is its complement, p is a phosphate group, z is a 3' hydroxyl or 
a 3 1 blocking group, n is an integer between 2 and 6, inclusive, r is an integer between 0 and 
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IK, inclusive, s is an integer which is either between four and six, inclusive, whenever the 
encoded adaptor has a nuclease recognition site or is 0 whenever there is no nuclease 
recognition site, q is an integer greater than or equal to 0, and t is an integer greater than or 
equal to 8. 

13. The method of claim 12 wherein r is between 0 and 12, inclusive, t is an interger 
between 8 and 24, inclusive, and z is a phosphate group. 

14. The method of claim 13 wherein members of said minimally cross-hybridizing set 
differ from every other member by at least six nucleotides. 

15. A method of determining the nucleotide sequences of a plurality of polynucleotides, 
the method comprising the steps of: 

(a) attaching a first oligonucleotide tat/ from a repertoire of tags to each 
Wrtide in a population of polynucleotides such that each first oligonucleotide tag 

repertoire is selected from a firstyminimally cross-hybridizing set; 

(b) sampling the population of nolynucleotides to form a sample of polynucleotides 
sucli/thal substantially all different polynucleotides in the sample have different first 
oligonucleotide tags attached; 

(c) sorting the polynucleotides of the sample by specifically hybridizing the first 
(igonucleotide tags with their respective complements, the respective complements being 
ttached as uniform populations of substantially identical oligonucleotides in spatially 

discrete regions on the one o/ more solid phase supports; 

(d) ligating one orifiore encoded adaptors to an end of the polynucleotides in the 
sample, each encoded adaptor having a second oligonucleotide tag selected from a second 
minimally cross-hybridizing set and a protruding strand complementary to a protruding 
strand of a polynuclatnide of the population; and 

^ (e) identifying a plurality of nucleotides in said protruding strands of the 
polynucleotides by specifically hybridizing a tag complement to each second oligonucleotide 
tag of the one or more encoded adaptors. 



1 6. The method of claim 1 5 further including the steps of (f) cleaving said encoded 
adaptors from said polynucleotides with a nuclease having a nuclease recognition site separate 
from its cleavage site so that a new protruding strand is formed on said end of each of said 
polynucleotides, and (g) repeating steps (d) through (f). 

1 7. A method of identifying a population of mlWA molecules, the method comprising 
le steps of: 
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(a) forming a population of cDNA molecules from the population of mRNA 
molecules such that each cDNA molecule has a first oligonucleotide tag attached, the first 
oligonucleotide tags being selected from a first minimallv/Cross-hybridizing set; 

(b) sampling the population of cDNA molecules to form a sample of cDNA 
molecules such that substantially all different cDNA/molecules have different first 
oligonucleotide tags attached; 

(c) sorting the cDNA molecules by specifically hybridizing the first oligonucleotide 
tagsf with their respective complements, the r^pective complements being attached as 
uniform populations of substantially identical complements in spatially discrete regions on 
o^ie or more solid phase supports; 

(d) ligating one or more encodfed adaptors to an end of the cDNA molecules in the 
population, each encoded adaptor having a second oligonucleotide tag selected from a 
second minimally cross-hybridizing set and a protruding strand complementary to a 
protruding strand of a cDNA molecule of the sample; and 

(e) determining the Identity and ordering of a plurality of nucleotides in each of said 
protruding strands of the cDNA molecules by specifically hybridizing a tag complement to 
each second oligonucleotide tag of the one or more encoded adaptors; 

wherein the txopulation of mRNA molecules is identified by the frequency 
distribution of the portions of sequences of the cDNA molecules. 

1 8. The method of claim 17 further including the steps of (f) cleaving said encoded 
adaptors from said polynucleotides with a nuclease having a nuclease recognition site separate 
from its cleavage site so that a new protruding strand is formed on said end of each of said 
cDNA molecules, and (g) repeating steps (d) through (f)- 

1 9. A method of determining the nucleotide Sequence at an end of a polynucleotide, the 
method comprising the steps of: 

(a) ligating an encoded adaptor to ad end of the polynucleotide, the encoded adaptor 
ing an oligonucleotide tag selected from a minimally cross-hybridizing set of 

igonucleotides and a protruding stnina complementary to a portion of a strand of the 
blynucleotide; 

(b) identifying one or moj^e nucleotides in the portion of the strand of the 
polynucleotide by specifically hybridizing a tag complement to the oligonucleotide tag of the 
encoded adaptor ligated thereto; 

(c) cleaving the en/oded adaptor from the end of the polynucleotide with a nuclease 
having a nuclease recognition site separate from its cleavage site so that a new protruding 
strand is formed at the/end of the polynucleotide; and 

(d) repeating steps (a) through (c). 
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The method of claim 19 wherein said protruding strand of said encoded adaptor 



contains from 2 to 6 nucleotides and wherein step of identifying includes specifically 
hybridizing successive said tag complements to said oligonucleotide tag such that the identity 
of each nucleotide in said portion of said polynucleotide is determined successively. 

2 1 . The method of claim 20 wherein said step of identifying further includes providing a 
number of sets of tag complements equivalent to the number of nucleotides to be identified in 
said portion of said polynucleotide. 

22. The method of claim 21 wherein said step of identifying further includes providing 
said tag complements in each of said sets that are capable of indicating the presence of a 
predetermined nucleotide by a signal generated by a fluorescent signal generating moiety, 
there being a different fluorescent signal generating moiety for each kind of nucleotide. 

23. The method of claim 22 wherein said oligonucleotide tags of said encoded adaptors 
are single stranded and said tag complements to said oligonucleotide tags are single stranded 
such that specific hybridization between an oligonucleotide tag and its respective tag 
complement occurs through Watson-Crick base pairing. 



24. A composition of matter comprising a douhle stranded oligonucleotide adaptor having 
the form: / 



here N is a nucleotide and N* \fc its complement, p is a phosphate group, z is a 3* hydroxy I or 
3' blocking group, n is an integer between 2 and 6, inclusive, r is an integer between 0 and 
8, inclusive, s is an integer which is either between four and six, inclusive, whenever the 
decoded adaptor has a nuclease recognition site or is 0 whenever there is no nuclease 
recognition site, q is an integer greater than or equal to 0, t is an integer greater than or equal 
to 8. / 




5'-p<N) n (N ) r (J/f ) S (N ) (N ) t -3' 
z<N») r /N') s (N') q -5' 



P(yN >r< N >s< N >q(N ) t -3 f 
3 , -z(N) r) /N , ) r (N , ) s (N , ) q -5 l 




* 



5$ 



The composition of claim 24 wherein/f is between 0 and 12, inclusive, t is an interger 
between 8 and 20, inclusive, z is a phosphate group, and said single stranded moiety (N) t is a 
/ member of a minimally cross-hybridizijffg set. 

26. The composition of claim 25 wherein n equals 4 and wherein members of said 
minimally cross-hybridizing set differ from every other member by at least six nucleotides. 
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27. A composition of matter comprising a double stranded oligonucleotide adaptor having 
the form: 



5'-p<N) n <N ) r (N ) S (N ) ( 



)t-3' 



P(N ) r < N >/< N )q(N ) t -3' 
3'-z (N) n (N» ) r (N"/ S (N» ) q (N' ) t -5 f 



N is a nucleotide and NT is its complement, p is a phosphate group, z is a 3* hydroxy! or 
ocking group, n is an integer between 2 and 6, inclusive, r is an integer between 0 and 
1 8, inclusive, s is an integer which is j^ither between four and six, inclusive, whenever the 
encoded adaptor has a nuclease recognition site or is 0 whenever there is no nuclease 
recognition site, q is an integer greater than or equal to 0, and t is an integer greater than or 
equjal to 8. 

The composition/61' claim 27 wherein r is between 0 and 12, inclusive, t is an interger 
between 8 and 24, inclusive, z is a phosphate group, and said double stranded moiety 

-<N ) t 
-(N') t 

is a member of a minimally cross-hybridizing set. 



29. The composition of claim 28 wherein members of said minimally cross-hybridizing set 
differ from every other member by at least six nucleotides. 
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